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Abstract 

Generalized relative entropy, monotone Riemannian metrics, geodesic distance, and 
trace distance are all known to decrease under the action of quantum channels. We 
give some new bounds on, and relationships between, the maximal contraction for these 
quantities. 
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1 Introduction 

It is well-known that many quantities of interest in quantum information theory contract 
under the action of completely positive and trace-preserving (CPT) maps, which represent 
quantum channels, including the relative entropy H (P, Q) = TrP(log P — log Q) of two posi¬ 
tive definite operators with TrP = TrQ. When <1> is a quantum channel, we can define 

P^Q, TrP = RQ|, (1) 

which describes the maximal contraction under 4>. Another contraction coefficient can be 
defined with respect to the trace distance as = sup ||<1>(P — Q)||i/||P — QUi, where the 

supremum is taken over P, Q as above. This can be regarded as the quantum version of the 
Dobrushin coefficient of ergodicity [12| . 

The concept of contraction coefficient was defined in the classical case m and generalized 
to the quantum setting in [38]. Similar definitions can be given to describe the contraction 
of many other quantities. We consider here primarily contraction with respect to quantum 
divergences (a special case of quasi-entropies), monotone Riemannian metrics and geodesic 
distances arising from them. There are many relations between the contraction coefficients 
of these quantities, which are our main concern in this paper. We also study the dependence 
of these contraction coefficients on the particular operator convex functions used to define 
them. Both classical and quantum contraction coefficients have important applications to the 
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problem of mixing time bounds of (quantum) Markov processes and in particular, (quantum) 
Markov chains, as demonstrated in, e.g., [TOIEIIIMIEQ]. 

We first recall what is known in the classical setting. A classical channel from to 
can be represented hy a d' x d column-stochastic matrix A. The trace-norm (or Do- 
brushin) contraction coefficient is 77 "*^(A) = sup || Ax||i/||x||i, where the supremum is taken 
over non-zero x € with Yli Xi = 0. On the Riemannian manifold Vd of probability vectors 
p = (pi,---,Pd), Pi > 0, Yli=iPi — 1) so-called Fisher-Rao metric is a unique classi¬ 
cal monotone Riemannian metric, for which we have the Riemannian contraction coefficient 
r 7 ^i®m(A). When 77 is a strictly convex function on (0,oo) with 77 ( 1 ) = 0, the classical 77 - 
divergence extending the classical relative entropy is defined as Hg{p,q) = J2i=i 9iPi/di)Qi 
for p,q G Vd, for which we have the contraction coefficient 77 ^®*^“^*(A). The following relations 
between these contraction coefficients were proved in mm- 

^RelEnt(A) = < 77 ^’’(A) < (2) 

whenever g is operator convex on ( 0 , oo). 

In the quantum setting, the study of monotone Riemannian metrics on the manifold Vd 
oi d X d positive definite density matrices was begun by Morozova and Cencov |42j . Petz 
[46] then showed that there were infinitely many such metrics, corresponding to positive 
operator monotone functions on (0,oo). Following |22( I38| we use the set of operator convex 
functions k > 0 on ( 0 ,oo) with k(1) = 1 and xk{x) = k{x~^) to parametrize symmetric 
monotone metrics on Vd, d G N. Such k functions correspond one-to-one, by At = 1//, to 
operator monotone functions / > 0 on ( 0 , oo) with /(I) = 1 and f{x) = xf{x~^) giving 
the same family of such metrics as in [46]. Thus, for each k function we can define the 
Riemannian contraction coefficient 77 ^'®™ (<h) of a channel from the d x d matrix algebra 
Mrf to Mrf/, and do so explicitly in (I19|) of Section 2.4. On the other hand, for each operator 
convex function 77 on ( 0 , 00 ) with 77 ( 1 ) = 0 and g"{l) > 0 we have the quantum ^(-divergence 
Hg{p,y) for p, 7 G Vd and the corresponding contraction coefficient 77 ^®^^’^*(<I>), as defined in 
(m and (HSj) of Section 2.3. As shown in as well as [38], and developed further here, the 
relation between 77 |^‘®™($) and their dependence on the k and 77 functions in 

the quantum setting are not as simple as in the classical setting. 

This paper is organized as follows. In Section 2 precise definitions of quantum 77 -divergences 
and monotone metrics parametrized by the k functions are given, for which we introduce the 
contraction coefficients r 7 ^®*^“*(<I>) and 77 ^‘®™(<I>). Section 3 provides familiar examples of 
77 -divergences and monotone metrics such as the BKM, the Wigner-Yanase, and the Bures 
metrics. In Section 4 a description of r 7 ^^®™(<I>) in terms of a certain eigenvalue problem devel¬ 
oped in |38] is recalled, which establishes the relation 77 j^'®™(<I>) < 77 "’"''(<I>) when k(x) = 

The main results in Section 5 are the general relations 


77r"(‘&) = h^''^”(‘&) < p“($), P^^(^) < (3) 

when 77 is related to k by g{x) = (x — 1 )‘^k{x). Here is the contraction with respect 

to the geodesic distance induced by the monotone metric for k as defined in (1211) . A lemma 
slightly modified from [25] is given in Appendix A to prove the equality in ([3|). The first 
inequality in ([3]) was given in [35| ; the second is proved in Section 15.11 strengthening results 
from [501148j . In Section 5.2, a partial ordering of contraction coefficients is shown to hold 
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when the domain or range of the channel is a commutative subalbegra, i.e., classical. Some 
remarks on extensions to weak Schwarz maps are given in Section 5.3. 

In Section 6 we treat qubit channels using the Bloch sphere representation. The section 
includes proofs of statements announced in [38], as well as additional results. In particular 
when <^ 7 ’ is a unital qubit channel described by a 3 x 3 matrix T as in |34] we prove that 

Next, for $ in the simplest possible family of non-unital qubit channels (which collapse the 
Bloch sphere to a line), we estimate for several particular cases of k as well as 

for Special g corresponding to the extreme k functions. These estimates suffice to 
show that the equality conditions in (|3j) above do not extend to non-unital channels; that 
the contraction coefficients depend on the functions k and g] and several natural conjectures 
are false. Complete proofs of the results in Section 6, which are elementery but somewhat 
lengthy, are given in Appendix [B] 

Finally in Section 7 we present further results on contraction coefficients for some special 
examples of Section 3. A remarkable result here is that the equality ~ 

holds for every channel <1>, where denotes the BKM metric contraction and 

contraction with respect to the symmetrized relative entropy H{P,Q) + H{Q,P). But the 
equality between and jg Qpgj^_ 

2 Notation and Definitions 

2.1 Basic notation 

For each d E N we write and for the sets oi d x d complex, Hermitian, 

positive definite, and positive semi-definite matrices, respectively. We also denote by Vd the 
set oi d X d positive definite density matrices and the set of all d x d density matrices, 
i.e., = {d S P(i : Trp = 1} and = {p G P^ : Trp = 1}, where Tr is the usual trace 

functional on The trace-norm of A G is ||A||i = Tr|A|. Recall that identified 

with i3(C'^) becomes a Hilbert space when equipped with the Hilbert-Schmidt inner product 

{X,Y) = TvX*Y, A,yGMrf, 

together with the Hilbert-Schmidt norm ||A ||2 = (Tr A real subspace of is 

identified with the Euclidean space of dimension d? , and is a smooth Riemannian manifold 
whose tangent space at any foot point is identified with ElJ) = {A G Elrf : Tr A = 0}. Functions 
f{A) of matrices A G are defined via the usual functional calculus. 

We will use linear maps $ : —)• M^/, and denote by $ the adjoint of with respect to 

the Hilbert-Schmidt inner product, i.e., (<I>(A),y) = (A, 4>(y)) for all A G and Y G M^/. 
As usual in quantum information, we call a (quantum) channel if is CRT (i.e., completely 
positive and trace-preserving) map. Most of the maps we consider will be constructed from 
the left and right multiplication operators, respectively, i.e., LaX = AX and RbX = XB for 
A,B,X G Mrf. For each A,B £ P^^, La and Rb are commuting positive invertible operators 
on the Hilbert space (however, they are not positive in the sense of mapping P^ into P^). 
More generally, for functions / : (0, oo) —)• R we have = /{La) and Rf{B) = f{B)- 
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2.2 Operator convex functions 


A real function / on (0, oo) is said to be operator monotone (or operator monotone increasing) 
if A > B implies f{A) > f{B) for every A,B^ with any d G N, and operator monotone 
decreasing if —/ is operator monotone. A real function g on (0, oo) is said to be operator 
convex if 

g^XA + (1 - X)B) < Xg{A) + (1 - X)g{B) 

for all A,B G P^ with any d G N and all A G (0,1), and operator concave if —g is operator 
convex. The theory of operator monotone and operator convex functions was initiated by 
Lowner m and Kraus [37], respectively. For details, see, e.g., [U Section V.4], also [UlTSlET]. 

In this work the following classes of operator convex functions play a special role: 


0 = {g : (0,oo) —>• R, operator convex, g{l) = 0, g"{l) > 0}, 

Qsyra = {g '■ (0, oo) —)• [0,oo), Operator convex, g{x) = xg{x~^) for x > 0, 

<7(1)=<7'(1)=0, /(I) = 2}, 

K. = {k : (0, oo) —)• (0,oo), operator convex, xk{x) = k{x~^) for x > 0, «:(!) = 1}. 

By Proposition 12.21 below there is a one-to-one correspondence k £ 1C g G Gsym determined 
by 

5 ((x) = (x - 1)^k(x). (5) 

It is easy to see that ii g £ G then g{x) = xg{x~^) is also in Q. Indeed, if g £ G, then 
g{x) /(x — 1) = {g{x) — g{f))/ (x — 1) is operator monotone on (0, oo) by Kraus’ theorem, and 
hence _ _ 

gjx) - g(l) ^ xg{x-^) ^ gjx-^) 

X — 1 X — 1 x~^ — 1 

is also operator monotone so that Ij is operator convex on (0, oo). Moreover, noting that 
g''{l) =g"{l), we dehne the symmetrization of g by 


^sym — 


9 + 9 

d"(l) 


G G. 


sym- 


( 6 ) 


Proposition 2.1. (i) If g : (0, oo) H is an operator convex function, then there exist a 
unique constant c > 0 and a unique positive measure g on [ 0 , oo) with + s)~^ dg{s) < 

-|-oo such that 


g{x) = g{l) g\l)ix - 1 ) -b c(x 



(x — 1 )^ 
X + s 


dn{s), 


X £ (0,oo). 


(7) 


(ii) If K : (0, oo) —>• R is an operator convex function and it satisfies the normalization 
«;(!) = 1 and the symmetry condition xk{x) = k{x~^) for all x > 0 , then k{x) > 0 for all 
X > 0 (hence k £ 1C) and there exists a unique probability measure m on [0,1] such that 


k{x) = 


1 + X 


{l + sf 


'[ 0 , 1 ] {x + s){l + sx) 


dm{s) 


j 

J[o,i] 


1 


-b 


1 


X -b S SX -b 1 


( 1 +^) 


dm{s), X £ (0, oo). 


( 8 ) 
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(9) 


The integral expression ([7]) was given in [35] and is a special case of [HI (5.2)]. Since 


g"{l) = 2(c+ f —^dn{s)\ 

V J[0,oo) 1 + 'S J 

we note that > 0 if and only if c + /x([0, oo)) > 0, or equivalently, g is not a linear 

function. For the proof of ([8|), see [22] Appendix A.2]. It is also obvious that k(x) > 0 for 
all X > 0 whenever k is a convex function with k(1) > 0 and xk{x) = k{x~^) for all x > 0. 


By Proposition EH /C is a Bauer simplex (in a locally convex topological vector space 
consisting of real functions on (0, oo) in the pointwise convergence topology), whose extreme 
points are 


( i = ii±£l! 1 + a; + ^ I ^ ^ 

* 2 (x + s)(l + sx) 2 \x + s 1 + sxy 


0 < s < 1. (10) 


It is well-known (and immediately seen from the integral expression ([8|)) that ki(x) 
is the smallest element of 1C and ko{x) = (1 -I- x)/2x is the largest in K, so that 


2 ., l-|-x 

< k[x) < —:— , K £ 1C. 


1 -|- X ' ^ 2x ’ 

In the sequel we use the more explicit notations Kmin for ki and Kmax for kq- 


2/(l+x) 

( 11 ) 


Proposition 2.2. For a function k : (0, oo) —)• (0, oo) consider the following conditions: 


(a) K is operator convex, 

(b) K is operator monotone decreasing, 

(c) g{x) = (x — 1)^k(x) is operator convex. 

Then (a) (b) (c). Moreover, if xk{x) = k{x~^) for all x > 0 or equivalently g{x) = 

xg{x~^) for all x > 0, then the above conditions (a)-(c) are all equivalent. 


Proof. For (b) (a), see [22l Theorem 2.4]. As for (c), Kraus’ theorem (see, e.g., [211 

Corollary 2.7.8]) implies that g is operator convex if and only if 

h(x) = _ i)k(]2;)^ X > 0, 

X — 1 

is operator monotone. The latter is also equivalent to the condition that k(x) = {h{x) — 
h(l))/(x — 1 ) is operator monotone decreasing. Indeed, this is seen from the facts that h is 
operator monotone on (0, oo) if and only if it has an integral expression 

f X — 1 

h{x) = h{l) +'y{x — 1 ) + / - dfi{s), xe(0, oo), 

i[0,oo) a: + s 

where 7 > 0 and g, is a positive measure on [0, 00 ) with 00)(1 ®)~^ dg{s) < -|-oo (see [TTl 

Theorem 1 . 9 ]), and that k is operator monotone decreasing on ( 0 ,oo) if and only if it has an 
integral expression 

k(x) = 7-b / - dg{s), xe(0,oo), 

J[0,oo} + 'S 

where 7 and g are same as above (see m, also [31 Theorem 3 . 1 ]). 

Next it is immediate to check that the conditions xk{x) = k{x~^) and g{x) = xg{x~^) for 
all X > 0 are equivalent. Under this symmetry condition, the implication (a) ^ (b) follows 
immediately from the integral expression ([8l), as shown in |22l Theorem 2.4]. QED 
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2.3 Relative entropy or ^f-divergence 


For every g € G and every A,B the (quantum) g-divergence of A relative to B is defined 
by 

Hg{A, B) = {B^I\g{LAR-B^)B^/^), (12) 


which is a generalization of the relative entropy and is a special case of quasi-entropies 
[Ml nans]. The most important property of Hg{A,B) is the monotonicity 


HgmA),^B))<Hg{A,B) 


for every A, R £ and every CPT map <1> : —)■ M^/. This was first proved by Petz [3ilH5] 

under slightly more restricted situations, and the above extension is in |38j . see also [5I1I23]. 

From the integral expression (|7I) it is known [381 Theorem II.5] that for every p, 7 £ Bd: 

Rg{p,l) + [ Tr(p- 7) ^ {p-j)dfi{s). (13) 

J[o,oo} i^p-\-sn-f 

Since {Lp + sR-f)~^ is a positive invertible operator on M^, the above expression together 
with d^D implies that Hg{p,'y) > 0 and that Hg{p,^) = 0 if and only if p = 7. 

In particular, when g £ ^sym and k £ 1C are given with ([5]), we note (see |38l Theorem 
II.5]) that for every A, B £ P^, 

Hg{A, B) = {A- B, Rb^k{LaRb^){A - B)) 

= {A-B, L^^k{RbL^^){A - B)) = Hg{B, ^). 


For every g £ G with symmetrization Psym in dS]) we have 


Hg{A,B) = Hg{B,A), 


Hg^yMB) 


Hg{A,B)+Hg{B,A) 

<7"(1) 


(14) 


For each CPT map $ : ^ and each g £ G we introduce the contraction coefficient 

of <I> with respect to the p-divergence Hg by 


^RelEnt(^) ^ RgiHp),^!)) 




From (fTTl) we easily see that 


^RelEnt / 


- 


RelEnt 

'/gsym ^ — 'ig 


($) < 


(15) 


(16) 


2.4 Riemannian metrics and geodesic distance 

Given a function k £ JC we define, for any A £ P^^, a linear map —>• by 

0^(X) = R^ffi{LARA^)X = L^ffi{RAL^^)X, X £ M^, 

where the equality of the two expressions follows from xk{x) = k{x~^). The positivity 
condition in the sense that n^(Prf) C P^ (equivalent to complete positivity) for the map 
has thoroughly been investigated in [22] with a lot of sample discussions. 
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Associated with k G /C a Riemannian metric M'^ on the Riemannian manifold is defined 


by 


M-{A,B) = {A,n;{B)), A,BeM% peV^. 


(17) 


This family of Riemannian metrics on T>d [d G N) induced by «; G /C is called monotone 
metrics since the class was characterized by Petz [l 6 ] with the monotonicity property: 


M$(,)(cl>(A),ch(A)) < Mp(A, A), A G hO, p G (18) 


for every CPT map ^ —)• with any d,d' . Here, note that although d>(p) in is 

not necessarily positive definite, the left-hand side of (IlSp is well defined by regarding d>(p) 
and <h(A) as matrices in HM^^/H = where H = supp<l>(/(i), the support projection, and 
k = diniH. More recent results on monotone metrics are found in [251126j . Also, note that if 
p and A commute, then (A, llp(A)) = Trp“^A^ independently of the choice of k G /C. This 
fact is essentially same as the classical result that there is only one monotone metric in the 
classical setting, known as the Fisher-Rao metric. 

For each k G /C the contraction coefficient of a CPT map $ with respect to the monotone 
metric M'^ induced by k is defined by 


Riem 

'Ik, 


(< 1 >) = sup sup 

p6X>d Am%A^0 


(ci>(A),ll-(^)(c^(A))> 

(A,OMA)) 


(< I). 


(19) 


We write ? 7 ma™(‘h) and for the contraction coefficients associated with the largest 

K^a,x{x) = (1 -|- x)l2x and smallest nram{x) = 2/(1 -|- x) functions in /C, while these need not 
be the largest and smallest contraction coefficients for a given <1>. (Indeed, Proposition 15.51 
below suggests that this is not true in general.) 

We write Df^{p,^) for the geodesic distance with respect to namely, 

D^{p,j) = inf y^(?'(t),Sd^p)(f(t)))dt, (20) 

where the infimum is taken over all (piecewise) smooth curves joining p, 7 in P^. The mono¬ 
tonicity of obviously implies that 


P«(d)(p),c^(7))<P«(p,7) 


for every CPT map —)• M^/. The contraction coefficient of $ with respect to the 

geodesic distance is then defined by 


-frk'i’) = 


sup 


D^{^{p)Ml)) 

Dk{p,i) 


1 2 


(< 1 ). 


( 21 ) 


We will prove the equality p^^®™($) = p|®°^(<I>) in Section [5A] while the inequality pj^‘®™($) > 
p|'^°^(<l>) was shown in [38l Theorem IV.2]. 

Remark 2.3. The space (d P^) is a smooth Riemannian manifold with the tangent space 
Hrf (D IHrf)- For each k G /C a Riemannian metric M'^ on P^ is defined by the same expression 
as m for A,B ^ and p G P^. For any CPT map <h, if A = p G P^ then we have 

($(A),0^(^)($(A))) _ Tr$(p) _ 

(A,0«(A)) Trp 









which says that the contraction coefficient of ‘h with respect to M'^ is meaningless if it is 
defined with instead of 

For any pair p, 7 S V^i in addition to (|20]) one can consider the geodesic distance 7 ) 

with the same expression but taken over all smooth curves joining p, 7 in without confining 
them to 'Del- The difference between the manifolds and P^ implies that D^{p,^) < 

Df^{p, 7 ). Moreover, for every k ^ K, this inequality will be strict when p 7 = jp and p 7 ^ 7 . In 
this case {p, 7 ) and {p, 7 ) are independent of the choice of k G /C so that the formulas 
(|Tf|l and (f28]l of the next section hold for any k £ 1C as does the inequality (| 2 ^ . Indeed, 
one can apply the monotonicity of M'^ to the trace-preserving conditional expectation onto 
a commutative subalgebra A containing p, a to see that curves ^ can be confined, in the 
definition ([20]), to those inside A. (See also the proof of Lemma [5~il i 


3 Examples 


Basic examples of ^-divergences and monotone metrics are in order here. Further discussions 
and results for these cases will be later given in Section 7. 

Example 1. (Relative entropy and BKM metric) The function g{x) = a: log a; G G gives the 
(usual) relative entropy, i.e., 

Hx log x{p,l) = H{p,-f) = T:Y p(log p - log 7 ). 

Moreover, g{x) = — logx and FI_iogx(/ 0 ) 7 ) = H{'y,p). The symmetrization 

ffBKM(a:) = X log X - log X G ^sym 

corresponds to the function 


kbkm(x) = g /C, 


which gives 


Cl 


BKM 


(X) = 


log Lp - log 

Lp Rp 


X — 1 


r 

iX)= / 
Jo 


1 


■X- 


1 


dt. 


( 22 ) 


/Q P -\- tl P -\- tl 

The corresponding monotone metric is the so-called Bogolieubov (or Kubo-Mori) metric. We 
write 7 bkm(‘^) contraction coefficients associated with kbkm and 

ilBKH- Rather surprisingly, it will be shown in Theorem 17.II that 7 bkm(‘^) ~ holds 

for every CRT map <I>. However, we know from Theorem 16.61 that this property does not hold 
in general. 

Example 2. (Maximal metric) The function g[x) = (x—1)^ G G yields the quadratic relative 
entropy 

H{x-i)^ {P, 7 ) = Tr (P - 7 )^ 7 "^ = Tr - 1- (23) 

The function 5max(x) = (x — 1)^(1 -|- x)/2x in ^sym is the symmetrization of g, which corre¬ 
sponds to the largest function Kmax in /C. The function Kmax defines the largest monotone 
metric with 


Qmax/ 
“P 


Note that for the choice X = p — 'j, 

{p-a,n^^{p-a))=H^,_,)2{j,p) 


(24) 
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Example 3. (Central power metric) The function x G /C gives 


which may be considered as the center of JC from some aspects. For instance, it is known 
[221 Theorem 3.5] that is the only function k G JC such that both Clp and (0)))“^ are 

CP for all p G Vd- The function in ^sym corresponding to is (x — and the 

corresponding divergence is 

%-i)2x-i/2(p,7) = Tr (p - l)p~^l‘^{p - 7)7"^^^- 
Example 4. (Wigner-Yanase-Dyson metric) For any t G (0,1) U (1,2] dehne 

/y> _ 

»'‘T) - 


whose symmetrized function corresponds to 


^WYD 


(x) = 


1 

i(l-i) 


(1 — x*)(l — x^ *) 

(1 — x )2 


G/C. 


(26) 


Note that the functions extend to the parameter t G [—1,2] with = kbkm for 

t = 0,1 by taking the limit as t —)• 0,1 and with as symmetric around t = 1 / 2 . 

For the particular case t = 1/2 the monotone metric associated with kwy = called 

the Wigner- Yanase metric with 

qWY ^ ^_ 

(v^+v^f’ 

and the divergence for (x) = 4(x — ^/x) is 

^4(x-Vi)(fo7)=4(l-TrpfoV/'). 


It seems that Hasegawa |18j was the first to realize the WYD metric as well as the WYD 
divergences could be extended to the full parameter range [— 1 , 2 ]. See also [2S] where equality 
conditions were given for the convexity of ^-divergences for the WYD functions. 

It is known mi Theorem 5.4] that for every p ,7 G Vd the geodesic distance D^^Y{p■,l) 
with respect to the metric for kwy is given as 

Dwy{p,j) = arccos Tr(27) 

On the other hand, the geodesic distance Dyjy{p-,i) taken over curves in (see Remark 
12.3p is included in |24l Theorem 2.1] and we have 

Dwy{p,i) = \\p^^‘^ - 7^^^||2 = - 2 Tr/ 9 V 2 ^i/ 2 ^ (28) 

Since ^/2 — 2t < arccost for 0 < t < 1, we see that 

-Dwy(/0,7) <-Dwy(p,7) (29) 


unless p = 7 . 
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Example 5. (Minimal or Bures metric) The smallest function = 2/(1 + x) in /C 

defines the smallest monotone metric with 

2 /■°° 

^r(^) = = 2 / dt, (30) 

+ -Kp Jo 

which is often called the metric (symmetric logarithmic derivative). This is considered 
as the infinitesimal form of the Bures distance introduced in [7] and was intensively studied by 
Uhlmann, e.g., [52], so it is also called the Bures or Bures-Uhlmann metric. The corresponding 
function in ^sym is gram{x) = 2 (x — l)^/(x + 1) and the corresponding divergence is 

Hminip, 7) = - 7, I ^Ji (p ~ T') 

Recall that the Bures distance [7| between p, 7 G is 

<^Bures(P)7) = ■\/2 — 2F[p, 7 ), (31) 

where F{p,'y) = Tr is the fidelity of p, 7. It is known (see [53] and m (9.32)]) 

that the geodesic distance between p, 7 G with respect to the Bures metric is given as 

Drainip, 'j) = arccos F(p, 7 ). (32) 

Since Trp^/^ 7 ^/^ < F{p,j) unless p 7 = 7 p (see [l5l Corollary 3.4] and |20l Theorem 2.1]), 
by comparing ([27]) and ([32]) one can see that Z?-wy(p, 7 ) > T)inin(p, 7 ) whenever p 7 / 7 p. 

4 Trace Distance and Eigenvalue Formulation 

4.1 Trace distance 

The most widely used distance for density matrices is the trace-norm distance ||p — 7 ||i = 
Tr Ip — 7 / The next trace-norm monotonicity property is a slight extension of [48] Theorem 
1]. We give a proof for completeness. 

Proposition 4.1. Let $ : —)• be a positive (not necessarily CP) trace-preserving 

map. Then 

||4>(X)||i < ||X||i, XGMd. (33) 

Proof. Note that the adjoint map <I> : —>• is positive and unital. Due to the Russo-Dye 

theorem (see, e.g., jG] Theorem 2.3.7]) we have 

|| 8 (z)||oo < Halloo, zeMd>, 

where || • ||oo denotes the operator norm. Since || • ||i is the dual norm of || • ||oo, we have the 
asserted inequality as follows: 

||4>(X)||i =sup{KX, 8 (Z))| : ZGMrfq ||Z|U < l} 

< sup{Kx,y)|: y G Mrf, ||y||oo < 1 } = ||y||i 

for every X G M^. QED 
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The contraction coefficient of a positive trace-preserving map $ with respect to the trace- 
norm distance is defined by 


^Dobrushin(^) = = SUp 

p,vex’d, p^7 


ll^(p) - ^(7)l|i 

llp-7lli 


sup 


Pill 


(< 1 ), 


(34) 


which is the quantum generalization of the classical Dobrushin coefficient of ergodicity. It 
was shown in |48l Theorem 2] that 


P(4>) 



p{||<h(£^ — T)||i : E,F £ Mrf, rank 1 projections, E ± 


(35) 


Next we extend the notion of scrambling column-stochastic matrices in m to the matrix 
algebra setting. 

Proposition 4.2. Let $ : —)• be a positive trace-preserving map. Then the following 

conditions are equivalent: 

(i) p(4’) < 1 ; 

(ii) for every rank 1 projeetions E,F £ with E ± F, Tr ^{E)^{F) > 0; 

(hi) for every non-zero A,B£ Tr <I>( 74 )<I>(i?) > 0. 


Proof. We denote by S the set of pairs {E, F) of rank 1 projections in with ELF. 
Since S is compact in x M^, there is an {Eq,Fq) £ T, such that ||^(iiio) ~ ‘f’(-^o)||i = 
sup(^^i 7 ’)gs P(.E') — ‘^*(-^)l|i- Here, note that, for any p,'y £ Vd, \\p — 7 ||i = 2 if and only 
if the support projections of p, 7 are orthogonal, if and only if Trpy = 0. Hence (i) 47 (ii) 
is immediate from (j35h . That (iii) ^ (ii) is trivial. To prove that (i) ^ (hi), assume (i) 

and let A,B € P^ be non-zero. Set p = A/Ti A and 7 = B/TAB; then p, 7 S Bd and so 

$(/ 9 ),<h( 7 ) £ Vd'. If /? = 7 , then Tr <h(^)<h(i?) > 0 is clear. If p / 7 , then assumption 

(i) implies that ||4>(p) — ‘h( 7 )||i < ||/9 — 7 II 1 < 2. Hence we have Tr<h(p) 4 )( 7 ) > 0, i.e., 

Tr4>(vl)4>(B) > 0. QED 


4.2 Eigenvalue formulation 


We here summarize for the convenience of the reader, an observation in [38] to link the 
Riemannian metric coefficient with an eigenvalue problem, which is of some interest in its 
own right. Moreover, it allows us to connect t/"’^( 4>) with r/^'®™(‘h) in some very special cases, 
as in Theorem 14.41 below. 

Let p £ Bd and $ : —)• M^/ be a CRT map. Define a linear map T : —)• by 

T = ^ (L!S(^))V2ci>(op-V2. 

Since T is a contraction and T'I'(Dp)“^/ 2 (^/^^ = note that {Id) is an 

eigenvector of TT corresponding to the largest eigenvalue 1. Let A 2 ( 4 ),p) denote the second 
largest eigenvalue (with multiplicities counted) of TT. Then A 2 ( 4 >,p) is represented as 


A2"(^,P) 


($(X),D^(^)(<I>(X))) 
X&A%X^Q (^Pp(^)) 


(36) 
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where = {X G : TrX = 0}. From the fact that i^p{A) G and {A,Qp{B)) = 
{B,Qp{A)) for all A,Bg one can easily see that the right-hand side of (l36l) coincides 
with Therefore, we have 

Theorem 4.3. |38L Theorem IV. 4] For every n £ 1C and every CPT map 

= sup A5(<i>,p). 

P&T^d 

Now assume that an eigenvector of TT corresponding to A 2 = X 2 {^,p) is given by 
(!4^)V2(x), orthogonal to (Op-V2(/^). Then TrX = 0 and TT(!4pV2(x) = X 2 {n^y/^{X), 
which is equivalently written as 

(F!p-i$F!^(^)($(X)) = A2X (37) 

Thus, finding A 2 ($,p) is equivalent to solving the eigenvalue problem (f37)) under the con¬ 
straint Tr V = 0. In connection with (I37p we define a linear map T : M^/ —>• by 

Since X{Id) = Id': 'I is trace-preserving. Here, assume that T is positive in the sense that 
T(Prf/) G Prf. Then, thanks to (1331) . ||T(Z)||i < ||.^||i for all Z G M^^/. Therefore, if V G is 
a solution of the eigenvalue equation (1571) . then we have A 2 ||V||i = ||T(<l>(X))||i < ||$(V)||i 
so that A 2 < ||4>(X)||i/||X||i. Since all linear maps involving in ((371) are self-adjoint, note 
that a solution X of (j37p with Tr V = 0 can always be taken in 

From the above argument, if T is positive for every p G 'Dd: then we would have < 

This situation indeed occurs, in particular, when both and are positive 

(equivalently, CP) for every p G Vd and every d £ N. But it is known [22l Proposition 3.5] 
that this latter condition holds only when k{x) = So we have 

Theorem 4.4. |50L Theorem 14] For every CPT map ‘h, 

5 General Contraction Results 

5.1 Results for arbitrary channels 

In this subsection we present a few general relations between the contraction coefficients 
defined in Section 2. The next theorem says the general equality between the Riemannian 
metric contraction coefficient and the geodesic contraction coefficient. The proof is based on 
a limit formula in |24] for the geodesic distance, whose proof is presented in Appendix lAl for 
completeness. 

Theorem 5.1. For every n £ K, and every CPT map ‘k : —)• M^/, 
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Proof. The inequality was shown in [351 Theorem IV.2]. To prove the 

reverse inequality, we use Lemma lA.ll in the appendix. For every p S 2?^ and every A S 
with A ^ 0, by the lemma we have 


(ci>(^),L!- ($(^))> 

- — - = hm 

{A,Q-{A)) e\0 

which implies that ry^'®“(<l)) < ry|‘^°‘^(<l>). 


D,mp),Hp + eA)) 
Di^{p,p + eA) 

QED 




For completeness we state the following theorem. The first inequality was proved in |38j . 
and the rest is a straightforward consequence of (fTTI) as discussed earlier. 

Theorem 5.2. |38L Theorem IV.2] For every g £ Q let ^sym be the symmetrization of g as 
in Q and k{x) = gsyra{x)/{x — V)^ £ K, as in Then, for every CPT map <h : —)■ 

In the next theorem we give a general inequality between the contraction coefficients for 
Riemannian metrics and the trace-norm, generalizng [481 Theorem 3] and |50l Theorem 13]. 

Theorem 5.3. For every k £ K, and every CPT map $ : —)• 

To prove this result, we first give a lemma generalizing m Lemma 5]. 

Lemma 5.4. For every k £ JC and every p £ 

l|/i||?<(/i,n-(/i)), A£H». 

Proof. Let £ be the trace-preserving conditional expectation from onto the subalgebra 
generated by A. The monotonicity of M'^ implies that 

{A,n-p{A)) > {A,n-,^^^{A))=TT£{p)-^A\ 

where the latter equality follows since £{p) and A commute. By the Schwarz inequality we 
have 

Pll? = (Tr|yl|)2 = (TrT(p)V2 .f(p)-i/2|^|)2 

< Tt£{p) ■ Tv£{p)-^A^ = Tt£{p)-^A‘^. 

Therefore, \\A\\l < {A,n'^{A)). QED 

Proof of Theorem 15.31 Let A £ H]] with A ^ 0 and assume further that A is invertible. 
Set p = |^|/||^||i £ Vd. By the above lemma we have 

||4>(yl)||2<(ch(^),L!-(^)(ch(^))). 

On the other hand, since p and A commute, we have 

(T,P^(^)) = Trp-1^2 ^ P||iTr|T| = Pll?. 

T'VjpypfQvp 

IIAll? - (A,!iJ(A)) 

By continuity we have ||<I>(^)||^/||74||^ < for all A £ with A ^ 0, proving the 

desired inequality. QED 
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5.2 QC and CQ channels 


Let —)• be a positive trace-preserving map. We call <1> a quantum-classical (QC) 

channel if the range of is included in a commutative subalgebra of , and a classical- 
quantum (CQ) channel if the range of <h is in a commutative subalgebra of M^. Note that if 
$ is QC or CQ, then positivity is the same asx CP, so it is indeed a channel. The following 
facts are easy to see: <1> is a QC channel if and only if there are an orthonormal basis {ipk}k=i 
of and a POVM in such that 

k 

Also, is a CQ channel if and only if there are an orthonormal basis {(t)k}k=i of and 
density matrices 7 ^ S , I < k < d, such that 

^(p) = Pft^khk- 

k 

Thus, our notions of QC and CQ channels coincide with those introduced in m- We note 
that both QC and CQ channels have the form <l)(/o) = X^fc(Tr Ffc/?) 7 fc with a POVM {Fk} in 
hUd Si 'jk & , introduced in m and shown to be entanglement breaking. Several equivalent 

characterizations of this class were given in [28]. 

We remark that when $ is purely classical, i.e., the ranges of and are in commutative 
subalgebras of and respectively, then can be represented by a d' x d column- 
stochastic matrix in some orthonormal bases of and . Thus [H Theorem 1] implies 
that 

for any independent choices of «: € /C and g & G- 

Proposition 5.5. Let ki,K 2 G K, and assume that ki{x) < K 2 (x) for all x > 0. Then 
^Riem((j)) > ^Riem^<^^ every QC channel <1>, and < T/K 2 '^™(‘h) for every CQ 

channel <h. 


Proof. The assumption ki < K 2 implies that < Ll'ff as operators on the Hilbert space 
Mrf for every p G When $ is QC, we have 


7/]^iem($) = sup SUp 

pex>d 


Tr^'(p)-i^>(A)2 

{A,n-{A)) 


for every At G /C. Hence ? 7 K('^™(‘h) > When <h is CQ, choose a subalgebra A of 

including the range of and let S : ^ M be the trace-preserving conditional expectation. 

Since £ is nothing but the inclusion A ^ we have $ = £^ so that $ = ^£. Therefore, 
we have 




sup sup 

pev^nA AeHOn.4, A^o 


(cl>(A),L!-(^)(cl.(A))) 

Trp-iA2 


for every At G /C. Hence ^^^(^“(‘h) < ??re 2 ^™(‘h). QED 


Thanks to (|lip . by Proposition 15.51 and Theorem 14.41 we have 
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Corollary 5.6. If ^ is a QC channel and «; G /C satisfies k{x) > x for all a; > 0, then 

If ^ is a CQ channel and k £ 1C satisfies k.{x) < x~^/'^ for all x > 0, then 


The next theorem shows a relation of for any g G ^sym with the coefficient 

^min™(‘^) respect to the minimal metric when $ is a QC channel. The proof is based on 
the integral decomposition ([8|). 

Theorem 5.7. Assume that <1> : —)• M^/ is a QC channel. Then for every n £ 1C and the 

corresponding g £ ^sym In ([S|), 


and in particular, 

for gmin £ ^sym given in Example 5. 


(38) 


Proof. The first inequality holds for a general CPT map $ due to Theorem 15.21 Now we 
assume that <1> is a QC channel and let k G /C be arbitrary. Since k admits an integral 
expression (or the extremal decomposition) in ([8|) so that we write 


k{x) = / Ks{x) dm{s), x £ (0, oo) 
with Ks G /C, 0 < s < 1, given in (fT?1]l . Moreover, for 0 < s < 1 let 

5.(x) = G e, 


(39) 


whose symmetrization (5s)sym £ ^sym corresponds in ([5]) to Kg, i.e., (s's)sym(a^) = {x—lfiKs{x). 
For every /?, 7 G we then have 

Hgip,l)= {p,-f)dm{s). 

-'[ 0 . 1 ] 


So it suffices to prove that 

< i?Sir(<^>)i^(5.w.(p>7), 0 < s < 1 . 

Since 


R-^Ks{LpR-^) R-^ 


1 


LpR^ ^ T s sLpRy ^ T 1 


+ 


1 


-1 


1 + s 


1 1 

+ 


2 \ Lp sR.y sLp T R.y j 


(40) 


(41) 


we have 


R^ Ks{LpR.y ) + Rp Ks{LjRp ) 
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1 + s [Lp + sR^) ^ + (s-L-y + Rp) ^ “I" -^7) ^ “1“ {^'y + sRp) ^ 

“ 2 \ 2 ^ 2 
^ 1 H“ 5 /Lp H" sR'y -\- sL^ -\- Rp\ f sLp -1- -t- L^ -\- sRp 

“^iv 2 ) +v 2 




In the above the operator convexity of x ^ on (0, oo) has been used. We then have 

^{9s)syin (P) 7 ) + -^(gs)sym (T) P) 


-^(gsisym (P> 7 ) 


= { p-7, 


R-y l^s{LpRy ) ~^ Rp l^siLyRp ) ^ 




1 


L P+S7 ~|“ Rj P+S7 

1+s 1+s 


-1 


+ 


L 7 + sp ~|“ ./? 7+sp 

1 + s 1 + s 


-1' 


•(p-7) 


= - p-7, Ul^+0^ (p-7) 


1 + s 


1 + s 


(42) 


thanks to (IHOl) . 

Now we consider the case where p and 7 commute. Since p and 7 commute with p — 7, it 
is easy to see from ()4T]i that 

R{gsUmiPn) = {p - 7,R^^f^s{LpR-^){p - -f)) 

= -^{P - 7 , {(P + 57)“^ + (7 + sp)"^}(P - 7)) 

= l^p-7,|o^+0^|(p-7)y 


Since ‘h has the commutative range, we can apply the above to ^{p) and 4)(7) to obtain 


^(gs)sy.(‘J>(p),‘^( 7 )) 


Hence (liOll follows from ([42l) and ()43]l . QED 


(43) 


5.3 Weak Schwarz maps 

Although we have restricted our consideration to quantum channels in the usual sense of CPT 
maps, the monotonicity property of ^-divergences and monotone metrics holds more generally 
under a positive trace-preserving map 4> : ^ M^/ whose adjoint $ is a weak Schwarz map 

in the sense that $(y*)$(y) < 4>(y*y) for all Y S M^/. The proofs of monotonicity of g- 
divergences in |44l 145] as well as the argument in [4^ for monotone metrics requires only this 
weaker condition, as discussed further in |23j . Thus, we can dehne the contraction coefficients 
for such maps $ rather than CPT maps, and most results in the paper extend to this slightly 
more general situation. 

We introduce the term “weak Schwarz map” for the following reason. For a positive linear 
functional (p on an operator algebra, the Schwarz inequality can be written as \(p{A*B)\‘^ < 
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4>(A*A)(j){B*B). The analogous result for a linear map on an operator algebra is the 
operator inequality 


<^{A*B)[^{B*B)]-^^{B*A) < <^{A*A) (44) 

first proved for CP maps by Lieb and Ruskai [iQ] in 1974. (In finite dimensions, an equivalent 
inequality was proved much earlier by Kiefer [33] in 1959.) In 1980 Choi | 8 | showed that (144]' 
holds if and only if ‘h is 2-positive. Thus, it would be natural to consider the Schwarz maps as 
precisely the class of 2-positive maps. However, earlier in 1952 Kadison m proved a special 
case of (|44p with A = A* and B = I, and it was later found that the condition A = A* could 
be dropped in many situations. Thus, the terms “Schwarz inequality” and “Schwarz map” 
were associated with the weaker inequality <I>(H*)<I>(H) < <I>(H*H). However, this inequality 
does not hold for arbitrary positive linear maps and we know of no characterization of the 
subclass for which it holds other than the inequality itself. 


6 Qubit Channels 


We now consider some special qubit channels. Some of these results were stated without 
proof at the end of [38]. Others are new and resolve conjectures discussed elsewhere in the 
paper [38] . 


We first recall the description of 1^2 as the Bloch ball briefly, see, e.g., [4311471149] for more 


details. Pauli matrices ai = 


0-2 = 


0 -i 
i 0 


and (J 3 = 


1 0 
0 -1 


with I = 


Any p G T >2 is represented 


form an orthogonal basis of the qubit Hilbert space M 2 . 
p = ^[I + w-it] by a unique w = {wi,W 2 ,wsY G with |w 
w-fj = wio-i + W 2 cr 2 + W 3 a 3 . Here, p is pure if and only if w is on the unit sphere, i.e., |w 
A trace-preserving linear map <I> : M 2 —)• M 2 is represented as 


as 


= y/wf + W 2 +wl < 1 , where 

= 1 . 


^{wq! + ■w-cj) = wqI + {wot + Tw)-a, rco £ R, w G R^, 


by a vector t G R^ and a 3 x 3 real matrix T = which, as observed in [S^, can 

be assumed to be diagonal without loss of generality. Clearly, is positive if and only if 
|t -I- Tw| < 1 for all w G R^ with |w| < 1, and ‘h is unital and positive if and only if t = 0 
and ||T||oo < 1, where ||T||oo is the operator norm of T. Necessary and sufficient conditions 
for complete positivity were given in [49]. In the special case when only Y 0) it was shown 
earlier by Fujiwara and Algoet m that a map of this form is CPT if and only if 

(Ai ± A2)^ < (1 =t -^ 3 )^ — 


when Afc are the diagonal elements of T. 

Theorem 6.1. For any unital map ■ I + w-cr 1 — )■ / -|- {T'w)-a where T is a real matrix 

with ||T||oo < 1; 

^Riem(^^) ^ ^ ry™(‘i>T) = ||r||L 

for every k ^ K, and every g ^ G- Furthermore, = ||iC||oo- 
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This result does not require that the map be CP. Note that = Halloo = 

V which is consistent with Theorem 15.31 

The next theorem treats a family of trace-preserving maps ^a,T ■ M 2 M 2 with two real 
parameters a, r determined by t = (0,0, r)* and T = diag(a,0,0); more explicitly. 


^a,T{woI + w-cr) = wqI + awiai + TWoas- 


In this case, the condition < 1 is necessary and sufficient for both positivity and 

complete positivity as shown in [Mills]- It is also easy to check that the range of the adjoint 
map $ is included in the commutative subalgebra generated by {/, ui}. Thus, when r 7 ^ 0, 
^a,T is a non-unital CQ channel. Below we assume that a > 0 and < 1. 


Theorem 6.2. Let $ = ^a,T be a non-unital CQ ehannel with a, r speeified above. Then 


7?TQ$) 

= 

a, 





l-r2’ 


> 

2 1 -1- \/l - T' 

“ 2(1-r^) 


> 



Vl — ’ 


> 

iJionte. 



2a2 


1 -|- Vl — ' 

uSlr)®) 

= 



Moreover, for the extreme points Kg of 1C given in m, 

r. O? 

rtr{^) = - , ,2 ^ 0 < s < 1. 

1 _ i^2 


(45a) 

(45b) 

(45c) 

(45d) 

(45e) 

(45f) 

(45g) 

(46) 


In the above, the function (1 + y/x^jAx G /C is the dual of kwy in Example 4, i.e.. 


1/kwy(2: ^) = (1 + yfxf'j^x. The identities ()45b|l and ( |45gD are of course contained in (l46]l . 
By Proposition 15.51 (for CQ channels) together with (I45bp and (|45gP we observe that 




(47) 


for every k £ JC. 

Although the bounds in the above theorem are sufficient to disprove two conjectures as 
remarked below, we believe that they are optimal, i.e., 

Conjecture 6.3. Equality holds in (|45cp through (|45el) above. 

In those cases in which we can compute r 7 ^^®™(<I'Q^T-) exactly, the supremum is attained 
when p = or, equivalently, w = (0,0,0)* and A = y-a with y = (yi,0,0)*. Since the 
output of 4> does not involve W 2 ,W 3 it is reasonable that there is no loss of generality in 


19 






















choosing W 2 = = 0. And since the channel is symmetric around tci = 0 or P = this 

choice is also reasonable. But a proof for arbitrary choices of k does not seem easy. 

If the above conjecture is true, then for these examples ki < K 2 implies < 

which is consistent with Proposition 15.51 

Remark 6.4. It follows from (j45ajl and ()45b[) that the conjecture [38] that 
is false. Indeed, whenever a > 1 — r^, it follows that 

>a = »®(-l>). (48) 

Since a > a^, parameters can be found that are consistent with the CP condition but satisfy 
(f48|) . In fact, a = T = l/\/2 will do. 

Remark 6.5. Although we do not know when equality holds in the bounds above, we have 
sufficient information to conclude that the largest contraction coefficient is not necessarily 
given by k{x) = as conjectured in [3T]. In particular, when = 1, the bound 

iTheorem 14.4p implies that equality holds in ()45d|) . so = ce < 

1 = ™ this case. This is foreshadowed by the fact that Proposition 15.51 savs that 

is monotone increasing in k £ 1C for a CQ channel. 

We also note that the inequality ^ becomes ^ ^ oi’ — i-’ which is 

equivalent to the CP condition ck^ + < 1. 


The next theorem disproves the conjecture [38| that r/^^®™(<I>) = (4>) for every 

K £ K, and the corresponding g £ ^gym. For 0 < s < 1 let G ^ be given as in (j39]l . so 
(5's)sym £ ^sym Corresponds to to the extreme Kg G /C. 

Theorem 6.6. Let as above. If 4r^ > (1 — a^)(4 — a^) (this is the case when 

= 1 with a> i)), then 


RelEnt 

hs 




RelEnt 

sym 




(49) 


for any s < 1 sufficiently near 1 (depending on a,T). Moreover, if s > y ~ 0.457, 

then ryRs'Ent (<|)^ ,^) > when = 1 — is sufficiently small. 


Note that (5o)sym is Pmax given in Example 2 and the three coefficients in (1491) are equal 
in particular when s = 0, as will be shown in Proposition 17.21 for general CPT maps. 

Theorems 16.1[ 16.21 and 16.61 will be proved in the whole Appendix B. 


7 Results in Special Cases 

7.1 BKM metric 

Theorem 7.1. For every CPT map <h, t?bkm(‘^) ~ 7bkm^*(^)- 
Proof. By Theorem 15.21 it suffices to prove that 

< 7fe(<^>) (50) 
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for every CPT map <f>. To do this, consider the line segment ^{t) = {l — t)p + t'y = p+t{'y — p), 
0 < t < 1, joining p, 7 S T>d. By using Daleckii and Krein’s differential formula (see, e.g., [211 
Section 2.3]) we compute the derivative 

= ^Ti'p(logp-log^(t)) = -Trp(^^logC(t)^ 

= -TrplogW(L^(t),i?^(4))(7-p), 

where log[^](a;,y) = (logx — logy)/(a; — y), the divided difference of logx. We hence have 

I H{p,m) = (7 -p) = -(p,f 2™(7 - p)) (51) 

thanks to () 22 l) . and similarly 

^^^(7,C(i)) = -(7,^^™(7-p))- (52) 

Therefore, 

^{^(P,^(i)) = {-/ - p,n^™i'y - p)) 

so that 

Hbkm {p, j) ^ H{p,'y) + H{'y,p) 

= {H{p, e(i)) - f(7, ^(1))} - {H{p, m) - H{i, m)} 

= £ jpH{p,m) - Hi-r.mndt 

^ L 7 “ “ f)) dt- 

By replacing p ,7 with <k(p),$( 7 ) we also have 

Hi)) = I\hi - p), )(‘^(7 - P))) dt. 

Since 


($(7 - p),f 2||^^))($(7 - p))) < ??bkm(^)(7 - Pt^f™il - P)), 0 < t < 1, 

the desired inequality (|50p follows. QED 


The differential expressions in ()5ip and (I52p are quite special, so it seems that we cannot 
apply the differential method as above for other k S /C. One may also consider the contraction 
coefficient defined in ([ 1 ]) 




p,7eI>d,P7^7 ^{p,^) 


with respect to the standard (non-symmetrized) relative entropy. One has Pbkm"*(^) — 
inequality in (IIGD . but it is unknown whether both contraction coefficients 

coincide or not. 
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7.2 Other special results 


We collect here some additional special relations that may be of interest. 

Since the maximal metric has the special property (1241) that every metric {A, can 

be realized as a quadratic relative entropy, Theorem 15.21 immediately implies the following: 

Proposition 7.2. For every CPT map 

i?Sr('i!) = (53) 

where ^max S ^sym is given in Example [3 


The identities (j53p and (j38l) show some asymmetry between the contraction properties of 
«(max and Kmin; (|53]) holds for all quantum channels while ([38]) does for only QC channels, see 
a counter-example (s = 1) in Theorem 16.61 for a CQ channel. 

The functions given in (12611 showed up through the representation of the Wigner- 

Yanase-Dyson skew information in terms of monotone metrics, as described in [22p Section 
2.4, Example 4.8]. Furthermore, as studied in [29], the trace functional of WYD concav¬ 
ity/convexity [39[ [2] is recovered by the quasi-entropy for given in (I25|) as follows: 

Jt{K,A,B) = {KB^/‘^,g^^\LARB^)KB^/‘^) = ^ {Tr K* AK - Tr K* A^KB^-^) 

t[L — t) 


for A,B^¥d with a linear term. The -divergence is 

1 — Tr 

= Jt{I,p,-f) = ——■ 

Note that H{p,^) = Ht{p,^) and H{^,p) = lim^^o t) so that Ht{p,^) forms a 

one-parameter extension of the relative entropy. By Theorem 15.21 we have 

Proposition 7.3. For every CPT map $ and every t S (0,1), 


Kiem < „Kel 

ViP^YD j S Vgit) 


RelEnt 


(d>) = sup 


1 -Tr4>(p)*d>(7)^-‘ 


1 — Tr 




For kyjy{x) = 4/(1 -|- y/x)"^ and «:min(a:() = 2/(1 -|- x), Theorem 15.II together with ([27)1 and 
(|57|) yields 




sup 

P,7eI5d,p^7 


arccos Tr 

arccos Tr pi/271/2 


sup 

P,l&'Dd,p¥^l 


arccos F(<l)(p), <1>(7)) ^ 
arccos F{p, 7) 


(54) 


Since 


arccos t 
arccos s 


2 


< 


1 - t 
1 — s 


for 0 < s < t < 1, 


from (l5411 and (ISTIl we also have 
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Proposition 7.4. For every CPT map 


sup 1 F{^p),^(j)) ^ r ciBures(^y,<^(7)) l _ 

Pn&T^d, p¥^1 P, 76 X>d,p 7 ^ 7 L “Bures 1/5,7J J 
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A Hiai-Petz Lemma 


The next iemma was proved in [23] in a siightiy different setting of the Riemannian manifoid 
Prf and its tangent space instead of Vd and Hi] here, whose proof can work in the present 
setting as weii. The proof is provided beiow for compieteness. 

Lemma A.l. [24L Lemma 4.2] For every k ^ KL, p ^ Vd and A £ H]], 


D^,{p,p + sA) 

e\0 £ 




Proof. First, recaii that if T is a iinear operator on the Hiibert space (M^^, (•, •)) represented 
as the Schur muitipiication by a matrix (tij) £ then T > 0 if and oniy if tij > 0 for aii 
i,j = 1,... ,n. We denote by I the identity operator on (M^^, (•, •)), which is represented as 
the Schur muitipiication by the matrix of aii entries equai to 1. To prove the iemma, we may 
assume that p = diag(Ai,..., A^). For brevity iet Uij = K{XiXj^) foi i,j = 1,... ,d and 
a = minjj Oij > 0. Since 

n;{X) = O X, XeMd 

(see, e.g., |22l Section 2.2]), it foiiows that Xlp > al as operators on (M^, (•,•)). For each 
5 £ (0, a), since 7 £ is continuous, there exists an ri > 0 such that if 7 £ and 

II7 ~ p \\2 < Ti then ||ri!) — Pp||oo < <5, where || • ||oo denotes the operator norm for operators 
on (M(i, (•, •)). Furthermore, since and || • II2 define the same topoiogy on Vd (see [Ml 
Chapter IV, Proposition 3.5]), there exists an ro > 0 such that if 7 £ Vd and Pk( 7, p) < ^’o 
then II7 - PII2 < ri. 

Now iet A £ H]] and choose a sufficientiy smaii e > 0 so that D^{p,p + eA) < tq and 
e||A ||2 < n. Let ^ : [0,1] —Vd be any smooth curve from p io p + eA such that Tk( 0 < 5*0, 
where Lk( 0 is the iength of ^ with respect to the monotone metric induced by k. Since 
Df^{(^{t),p) < ro and so ||^(t) — p \\2 < n for aii 0 < t < 1, we have 

L.iO = dt 

= [\\{n;-6iY/^c'mdt 

Jo 
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>||(0"-<5I)V2(eA)||, 


= e 




IS 


In the above, note that — dl > 0 on (M^, (•,•)) since 6 < a. Also, the second inequality 
above follows since ||(n^ — dt is the length of the curve 0 < 

t < 1, from (f}p — to (Up — {p + eA) in the Euclidean space (Hd, || • II2) and it i 

shortest if ^ is the segment between p and p + eA. Taking the infimum of yields 

Dk{p,P + sA) > e (^{aij - o A 


On the other hand, let ^o(^) = P + teA. Since \\^o{t) — p \\2 < ^ll^lb < ri for 0 < t < 1, we 
have 

D^{p,p + eA) < L^i^o) = dt 

< 1^' mm)) dt 

= \m; + 6m{eA)h 


= e 


O-ij + 


A 




Since 5 is arbitrary, 

D^,{p,p + £A) 
£\o e 


/ IJ Z V 


as desired. 


QED 


B Qubit Proofs 


B.l Unital qubit channels: Proof of Theorem 16.1] 

Writing E = ^(J + w-a) and F = + x-cj) with |w| = |x| = 1, we find EF = 0 w = —x, 

in which case E — F = w-a so that by (|35l) . 

5 sup Tr|(rw)-o-| = sup |rw| = ||r||oo. (55) 

|w|=l 1 


Note that this implies that $7^ is “non-scrambling” if and only if ||r||oo = 1- 
It then follows from Theorems 15.11 and 15.31 that 

ir"(<i>T) = > iiTiiL 

for all K G E. Thus it suffices to show that < ||T||^ for all g G G- In fact, from 

the integral expression (fl^ it suffices to do this for ps ^ {x — l)^/(x -|- s), s > 0, as in (|39]l . 
for which we have 

(P.7) = Tr (p - 7) ^ (p-j). (56) 
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Let p = ^(/ + w-(t) and 7 = ^(/ + x-(t) with |w|, |x| < 1 and y = w — x 7^ 0 (which guarantee 
/?, 7 € P2 and /? 7^ 7). For s = 0 use (f78]l and ([7^ below to have 

Hgo{p,l) = 2Tx{y■af{I + ^N■aY^ = 

1 — w 

Since <hr(/o) = ^[7+ {T'w)-a] and <hr(y'0') = {Ty)-cr, we have 


so that 


■^go(^r(p),^r(7)) ^ 1 - |wp _ 4|Typ ^ 1 - |wp _ [Typ 

Hgo{p,l) 4|y|2 l-|Tw|2 l-|rw|2 |y|2 


RelEnt 


(4>t) 


1 — |w| 


. . \Ty\^ 

■ ®^P Tl2“ 

iw|<ii-|Tw|2 y^o |yr 


ml 


Next, for s > 0 we use Lemma lB.41 with y 7^ 0, u = w — sx and v = w + sx. Note that 
since y = w — x = ^[(1 + 'S)u + (1 — 'S)v], y is orthogonal to u x v so that we can use the 
reduction by Lemma IB. 5 1 to conclude 


„MEn.(^ , ^ - (ry. ([(1 + - iru^ i + |ru)(ru| - |rv)(rv|)-‘ Ty) 

(y,([(l + s7-|uP]/ + |u)H-|v)(v|)-‘y) 


where the supremum is taken over 


{(u, V, y) : y = w — X, u = w — sx, v = w + sx, |w| < 1, |x| < 1} 

= |(u,v,y) : y = ^[(1 + s)v - (1 - s)u], |u +v| < 2, |u - v| < 2s 

Since relaxing the constraints can only increase the supremum, we will let y (7^ 0 ) S 
arbitrary, and since 


{(u, v) : |u + v| < 2, |u — v| < 2s} C Ms = {(u,v) : |u| < 1 + s, |v| < 1 + sj, 


we allow (u, v) G Ms- Then we hnd using Lemma IB. II that 


RelEnt 

hs 


{T*y, ([(1 + s)^ - |up] J+ |u)(u| - |v)(v|) T*y) 
(y, ([(1 + s)^ - |Tu| 2 ]/ + |ru)(Tu| - |rv)(rv|)y) 

|r*y| 2 [(l + s )2 - |u| 2 ] + |(y,ru )|2 - |(y,Tv )|2 
|yP[(l + - |Tu| 2 ] + Ky,Tu )|2 - Ky,rv )|2 ’ 


(57) 


Now consider 

|r*y|2(l + s)2-|up|r*yp + |yp|rup _ |r*yp |up 

|y|2(l + s)2 |y|2 (l + s)2 

Now if the quantity in [ ] on the RHS of ([581) is < 0, then the LHS of (f58]l is < |T*yp/|yp < 
I|7 "*||to = ||T||^ < 1. On the other hand, if it is > 0 then from |u| < 1 + s the LHS of (|^ 
is < |ru|7|u|2 < ||r||^ < 1. Thus, if we let a = \T*y\‘^{l + s)^ - |u|2|T*y|2 + lyplTup, 
b = |yp(l + s)^, and c = |yp|rup — |(y,Tu)p + |(y,Tv)p, then the fraction in (1571) is 
{a — c)/{b — c). Since o > c > 0 and a/b < ||T||oo < 1 by the analysis above, for every y 7^ 0 
and (u, v) G Ms we have 


which implies that < ||T||^. Therefore, < ||T||^ holds for all g ^ G- 

QED 


|Tu|^ \T*y\^ 


u 


|y|^ 


(58) 
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B.2 Non-unital CQ qubit channels: Proof of Theorem 16.21 


Before proving Theorem 16.21 observe that the effect of ‘h = on P = ^(/ + w-cj) is to 
map w I—)• (atci, 0, r)* and on ^ = y-a it is y i—)• {ayi,0, 0)*. There is no loss of generality in 
simply using P = I + w-u whenever the factors of ^ will cancel. 

B.2.1 Dobrushin coefficient 

As in the proof of (1551) . 


= I sup TrlarcicJil 

|w|=l 


= a 


with the supremum attained at w = (1, 0, 0)*. 


B.2.2 Maximal metric 

It follows from ([7^ and (ITHI) that for P = / + W-(7, 


(yo-,P^^^(ycj)) = Tr(ycj)^P ^ = - 


2|y|' 


— W 


SO that 


7maT(^a,T) = SUp SUp 


Vl 1 - |w| 


iw|<iy^o |yr 1 - « 

2 
1 


2 l-w'^ 


= a sup --^ 

hl|<l 1 - « K - T 


a 


< 1 , 


l-r2 - ’ 

since it is easy to verify the last equality when _|_ ^-2 < ^ 

B.2.3 k { x ) = 

For k { x ) = first observe that it follows from (I79p and (j80h that 

(y-cr)(/ + = 


C 


2(l-|w|") 


-^(w • y)I + [y + ^(w X y)] -o' 


with C = C(|w|) = 1 + so that for P = I + w-cr 


Tx{ya)Q.f"'\ya) = Tr (y.a)(/+ wa)-^/^) 


C 


2(1 — |w|2) 

c 

2(1 - |w|2) 

|yp 

2(1 — |w|2)C 


^(wy)2 + |y|2-^|wxy|' 
^|wp|yp(cos^ 9 - sin^ 6) + |yp 


^|w|^ cos 29 + . 


(59) 


(60) 


(61) 
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where 6 denotes the angle between w and y. Thus 


^ ( 62 ) 

a^Ui 1 — |wp C(|w|) + r^) cos 26* + C^(|(au;i, 0, r)|) 

|yP 1 — ^(|(Q;t(;i,0,T)|) |wP cos 20 + C^(|w|) 


with 6 the angle between (ayi,0,0) and (arci,0,r). The first ratio in this product is largest 
when y = (yi, 0,0) and y enters only implicitly in the last one in 0,0. Assuming y = (yi, 0,0) 
and tci = 0 gives 0 = 0 = 7r/2 and cos 20 = cos 20 = —1. Then the identity C‘^{x) — = 
2Vl — C,{x) can be used to simplify the RHS of (l62]l to give 


> sup „. 2 1 - |wp c(|w|) 2yr^c(ki) 

^ ' ’ “ w=(0,J,,«,3) 1--^" C(|t|) 2v'1-|w|2C(|w|) 


sup a' 

W=(0,Ul2,1i’3) 


a/I “ 


w 


a 


Vl — 


Vl — ’ 


which is equivalent to 

We could have obtained this bound more easily by considering the special case P = I. 
However, since the methods used to obtain (|6ip are used again later, there is some merit 
in presenting the details in this relatively simple setting. For the special case, observe that 

/ 1 0 ^ 

$(/) = / + TCT 3 SO that = I 1 


and 


/I—T , 


from which it follows that 


1 


Vl — 


I, 


> supa2 


1 


a 




yP Vi — ^/l — P 


(63) 


B.2.4 BKM metric 

For KBKM(a^) = (log x)/{x — l) we can repeat some of the strategy above to get a lower bound. 
It follows from (j22p that 




r 

iA)= / 

Jo 


1 


A- 


1 


du. 


IQ P + ul P + ul 
Then using (17^ with o = 1 + u and ([THp . we find for A = y-a and P = I + w-a 

1 _ -w • yl + [(1 + u)y + iw X yj-cr 


(1 + U)I + W-CT 
from which it follows that 

1 


(1 + uY — |w |2 


Tr(yo-) 


(yo-)- 


1 


(1 + u)/+ W-(T ^ (1 + tt)/+ W-(T 


= Tr 


(y^^) 


1 


(1 + u)I + W-(T 


[(1 + uY — |wP]' 


|w • yp + (1 + uYIyI"^ - |w X y|' 


27 




































2|y| 


[(1 + n)2 — |wP]' 


[(1 + tt)^ + I wp cos 20], 


where for the second equality above we have used y • (w x y) = 0 and 9 is the angle between 
w and y. For 9 = vr/2, cos20 = —1 so that (I64p becomes |yp/[(l + — |wp]. Then since 

the integral 


f 


1 


/O (l+u) 2 -|w |2 

is elementary, we can conclude 

> sup 


du = 


1 


2 w 


logy 


l+|w| 


2 Vi |w| 

|y|" ^ log^ 


a 


uii=0, y'W=0 


a 


sup 


— logy^ 


1+r 


W 


w=(0,-u;2,-uJ3) ^ log i^ j wl 


a 




since the inequality log > 2x implies that 

= lim 


w 


w 


sup -r, I I 

|W|<1 log^ 


|w|^0 log 


l+|w| 


(64) 


This appears to be a reasonable bound when r is small. Although it might appear to blow 
up when r —)• 1, the CP condition = 1 implies that if r —>• 1 then a —)• 0. Since 

1 < ^ log < y 3 yr for all r G (0,1), we can conclude that o? < 0bkm(^) < 1^7^ 
consistent with (1471) . 


B.2.5 Dual of WY metric 

For the WY function kwy{x) = A/{1 + ^/x)‘^, the dual function l/K;wY(a^~^) = (l + -\/x)^/4x 
gives the operator 




.p +Rp^'^f 


— 

To proceed as above, for P = I + w-u we first find 

(Lp^'^^ + i2p^'^^) (y-cr) = (1 +w-cr)”^/^(y-fj) + (y-cr)(/ +w-u)"^/^ 

r(y •w)/ + yl. 


2C(|w|) 


1 — w 




(65) 


where we have used (|60p and the fact that the w x y terms have the opposite sign and cancel. 
As in Sections IB. 2. 31 and IB. 2. 41 the resulting expressions are difficult to deal with unless we 
make the simplifying assumption y = (yi, 0, 0)* and tci = 0 which yields the lower bound 


1 — |w| 


1 — T w=(0,-u)2,i«3) 1 + -^/l — |wp 


sup 


= a 


1 + Vl — 

2(1-r2) • 
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B.2.6 Minimal or Bures metric 


For the smallest function Kmin(a:^) = 2/(1 + x) in /C we begin by using (18^ of Lemma [6.31 to 
conclude that for P = I + w-cj 


Tr(yfT) 


Lp + Pp 


(y-o-) = y 


y + 


(w • y) w' 


1 — IwP 


|yp(l-|wp) + (wy)- 
1 — Iwp 


Thus 


Riem 

'/min 


, . — |wp) 1 — 

$ = sup sup — 127 -- 1 I 2 X , . -^ 7 - 2 - 

|w|<iy^o |yr(l - |w|^) + (w • y)^ 1 - r2 - a^wf 


For fixed w we can optimize the first term, which depends only on the ratios 2 / 2 /yi and 
2 / 3 /yi) directly or use Lemma IB.21 to conclude that the maximum is achieved when y = 
( 1 , —wiW 2 /(l — wf), —wiW 3 /(l — wf)y. In this case, 


yf(i-lwp) 


(1 — |w 


2\2 


Thus 


Riem 

'/min 


yP(l — |wP) + (w • y)2 1 — wf 

1 - w? 




When the CP condition < 1 holds, it is elementary to verify that 


1 — ref 


< 


1 


1 — — a^wf 1 — 

so that the supremum is achieved when rci = 0 , and 


( 66 ) 


(67) 


B.2.7 Extreme points 

The extreme functions have the form 

Ks(x) = 


1 + s / 1 


+ 


1 


+ S 1 + SX 7 ’ 


for which the corresponding operator is 

K- = i±i 


1 


+ 


1 


Lp + sPp Rp + sLp 

Observe that for any function h : (0, 00 ) —)• R and X G M^, 

[Rp^h{LpR-yiX)]* = L-^h{RpL-y{X*) = R-^h{LpR-y{X*), 

where h{x) = x~^h{x~^). Using this for the function h{x) = (1 + s)/{x + s) for which 
h{x) = (1 + s)/(l + sx), we have for A = A* 


{A,np(.4)) - < A. ^ (A)) = (a, „/+7_ (A) 


Lp + sRp 


Rp + sLp 


( 68 ) 
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Now we let P = / + w-cr, A = y-a and apply Lemma lB.41 with u = (1 — s)w, v = (1 + s)w 
to obtain 


{A,n^p^{A)) 


where we have used (f8T]l and 


2(1 + [6(|wp)/ - 4s|w)(w|] V) 

2(1 + 5)^ r, |2 4g(wy)^ 1 

6(|w|2) (1 + s)2(l - |’W|2)J ’ 


^s{x) = (1 + - (1 - = (1 + S)^(l - X)+ 4:SX. 


(69) 


Thus 

{A, n^pA) 

(1 — |w|2)^^(|w|2) a'^yj [(1 + s)^(l — a^wf — r^) + 4sQ:2rt;2] 

|y|2(l + s)2(l — |w|2) + 4s(w • y)2 (1 — — r^) ^s{a^Wi + r^) 

so that 


Riem 
'Ik, 


(CD) 


(70) 


= sup 

|w|<l 


( 1 -|W| 2 )^,(|W| 2 ) 


sup 


(1 - a'^wf - r 2 ) + r^) y^o 


yl [(1 + s)^(l — a'^wf — T^) + 4sa2-u;2j 

|y|2(l + s)2(l — |w|2) + 4s(w • y)2 


We first consider supy with w fixed. This term depends only on the ratios 2 / 2 /yi) Vs/yi so 
that there is no loss of generality in assuming yi = 1, in which case only the denominator 
depends on y and we consider instead its minimum, i.e., we seek 


min [(1 + ^2 + 2 / 3)(1 + s)^(l “ |wp) + 4s(u;i + W 22/2 + VsVsf] 
y2,y3 


which is found in Lemma lB.21 with /r = (1 + s)2(l — |w|2) and 12 = 4s. The minimum is 


(1 + S) 2 (l - |w| 2 )g,(|w| 2 ) 

6(|w|2) - 4su;f 


(71) 


Inserting (ITT!) into (170l) yields 


r?;^-“(4>)=a2 sup ■ 
|w|<l 


2 [(1 + s)^(l — a^Wi — T^) + 4sa2u;2j [^s(|w|2) — dstcf] 


(1 — a'^wf — t 2 ) + t 2 ) (1 + s )2 


The only term in the expression above which involves w rather than wi is ^<j(|w|2) 
= (1+ s)2 — (1 — s)2|w|2, which is largest when |w| is smallest, i.e., |w| = |r(;i| or, equivalently, 
w = (tci,0,0). Thus we find 


r?^-“($) = sup 


(1 + s)2(l — a^wf — T^) + 4sa 




1 — tcf 


(1 + s)2(l — a^w\ — t2 ) + As{a'^wl + r^) 1 — a'^wf — 


(72) 


For s = 0,1 this reduces to the expression for the maximal and minimal functions, (1591) and 
(j66p . respectively. As in (1661) the second factor is largest when n;! = 0. The first factor can 
be written as 

R _ ^ Ast^ 

R + 4sr2 R + 4sr2 ’ 
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which is largest when 

i? = (1 + s)^(l — a^w\ — r^) + Asa^wl = (1 + 'S)^(l — r^) — (1 — s)‘^a^w\ 

is largest, which also occurs when w\ = 0. Using these observations in (1721) we can conclude 
that 


^Riem 


(d>) = 


(i + «) 


2^2 


a 


(l + s )2 


— (1 — s) 2 r 2 


-(l^\ 


Vl+s ) 


(73) 


When s = 0,1 we recover the expressions (IMI) and (1671) . 


B.2.8 Wigner-Yanase metric 


Although we have had to make simplifying assumptions to obtain lower bounds for all but 
the extremal n, it is quite remarkable that we can obtain an exact expression in the case of 
the Wigner-Yanase function k-wy{x) = 4/(1 -|- Then 


(VAF + 

For P = I + w-a, using (1801) and Lemma lB.31 we can write 


\/ Lp + yj Rp 


(y^) = \l^ 


Cw • y 

^2 — |w|- 


■7+ y + 


(w • y)w 
^2 _ |.^^|2 


■a 


with C = C(|w|) = 1 + -s/l - I wp. Therefore, 


Tr(ya) 


{y/Lp + y/Rp)^ 


(ya) = Tr 


p/Lp + y/Rp 
■ C'(w.y)2 


(yo-) 


C (C^ “ |wp )2 

4 


, ,9 (w • y)2 (w • y)2|w 

+ |y| +2^-^ + 


c 


2 , / n 2 - |w 


^2 _ |.^|2 ^(^2 _ | w | 2)2 

2 


|y| +(w-y)" 


(C^ — |wP )2 
-1 


= 4(y, [C/- (2 - C ^)|w)(w|] y), 

where the last equality is the key to our ability to evaluate exactly. To obtain this, 

one can apply |w|2 = ^(2 — () and (IHT]) to see that 


■ 3(2 - |w |2 , , 


2 C - 1 , , , , 

7 + 7^:77 - 77 W |w)(w| 


2 C(C - 1)2 

= [C7- (2 - C"^)|w)(w|]"\ 


Then with T = diag(Q:,0,0), w = (Q;u;i,0,r)* and ( = ((|w|), we use Lemma rB.il to obtain 

{Ty, [C7 - (2 - C"^)|w)(w|]"Vy) 


rywY^(4>) = sup sup 


|w|<iy7^0 (y, [C7-(2 - C"^)|w)(w|]" y) 
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{Ty,[CI - {2 - C ^)|w)(w|]ry) 

= sup sup ^^ 
|w|<iy^o (y, [C/-(2-C"^)|w)(w|]y> 

C|ry|2-(2-c-')Kw,ry)|2 

= sup sup - 

|w|<iy7^o C|yP - (2 - C"^)Kw,y)|2 

[C - wf{2 - C~^)]yj 


= sup sup 


|w|<i (j/i,t/3)7^(0,0) CiVi + yi) - (2 - C ^)iawiyi + rys)^ 
Since the denominator of the last ratio is 

2 


[C-r^( 2 -C-^)] (^ys- 

it follows that 


aTWi{2-C ^)yi 


, - {a'^wj + t‘^){2 C - 1) 2 

“I -T"--yij 


<-T"(2-c-‘)y c-t2(2-c-‘) 

"[c-w;( 2 -c-‘)][c-t"( 2 -c-')] 


a 


|w|<i e-{a^wl + T^){2C-l) 

Then (^ = 1 + y^l — depends on wi only and 


c - wl {2 - c^) = 1 + 


wp — tCi 


2 - 


I-VT^ 


W 


W 


= 2(l-u;f)- 


2 ^ {wl+w^){l - v^l - |w|2) 


w 


which obviously takes the maximum 2(1 — wf) when W 2 = W 3 = 0. Therefore, 


ywY“(^«,r) = sup 


2a^(l — wf) 1 + y^l — — a^wf — t‘^\2 — ^ 




|«)i|<i (1 + -^1 — — a^tcf) — {a^w\ + r^) (l + 2 y^l — r^ — a^wl) 


= sup 


a^(l — w\) [ 2(1 — r^) — a^wl + 2(1 — r^)y^l — — a^w'l 


(1 — — a‘^Wi)[l + Y^l — — a'^wf)" 


< 


1 — wi 


sup 

|uil|<l 


1 — — a'^wi 


sup 

|u;i|<l 


2(1 — T^)(l + -^1 — — a'^w'f) — a^w\ 


(1 + Y^l — — a^wl) 

As in ()66p the hrst supremum is attained when wi = 0. For the second, let p = 1 — and 

X = a^Wi G [0,a^) where < p < 1, and observe that the ratio can be written as 

2p(l + y/p- x) - X 


= p+ (1 - p)- 


p — X 


(1 + ^p - xY (1 + 1/p^^)^’ 

which is maximized when x = 0 (i.e., tci =0). Thus we conclude 

2(1 — r^)(l + y^l — — a^rcf) — a^w\ ^ ^ p 2(1 — r^'' 


sup - ^ 

hi|<i (1 + -^1 - - a^w'l) 

Combining the two suprema yields 

^Riem 


VWr{^o.,r) < 


= p + (1 - p) 


2 a^ 


{i + ^pf i + vr^' 


1 + VT^ 


r2 ’ 


It is straightforward to see that the special case y = (pi,0,0)* and w = 0 yields the reverse 
inequality. 

The proof of Theorem 16.21 is now complete. QED 
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B.3 Proof of Theorem 16.61 


For P = I + w-a, Q = I + x-a and 0 < s < 1, by (1561) and Lemma lB.51 we have 


HgAQ.P) 

2(1 + s) 


■Tr (yo-)- 


-(yo-) 


2(1 + s) ^ ' Lp sRq 

(y, [{(l + s) 2 _|u| 2 }/+|u)(u|- 



where y = w — x, u = w — sx and v = w + sx, and we note that y is orthogonal to u x v. 
The formula for Hg^{P,Q) is similar with w, x interchanged. Since the first inequality of 
(I49p holds in general by (1161) . we will estimate ^ por this we take 

w = (tci, 0 , 0 )* and x = 0 for simplicity, for which we have 


HgsiQ.P) _ _^ 

2 ( 1 + s) (l + s )2 

Since ‘h(P) = / + w-cr and <^((5) = 
the expression 


_ HgAP,Q) _ Wl 

— wl' 2(1 + s) (1 + s )2 — 

I + x-(T where w = (arci, 0, r)* and x = 


(74) 


( 0 , 0 , r)*, we have 


Hg^{^{Q)MP)) 

2 ( 1 + s) 


(y, [{(1 +»)" - lap}; + |a)(a| - |v){v|]^'y), 


where y = w — x = (arcijO, 0 )*, u = w — sx = (ar(;i, 0 , (1 — s)t)* and v 
(atcijO, (1 + s)r)*. The matrix form of the operator inside [ of (1751) is 


(75) 
w + sx = 


'^s(r2) - a^w\ 0 

0 - oP'wl 

—2saTWi 0 


—2saTWi 
0 

'^s(t^) — 4sr^ — a^vui^ 


where ^<j(') is in (1691) . The (1, l)-entry of the inverse of this matrix is [‘^s('r^) — dsr^ — 
/ det where det is the determinant of the 2 x 2 matrix of the first and the third rows 
and columns. Therefore, the exact form of (1751) is 


Hg^{^{Q),^{P)) _ _ a^wl [6('r^) - a^wl - dsr^] _ 

2(1 + s) [Cs('r^) ~ [^s(t2) — a^wl — dsr^] — ds^a^r^rc^ 

A similar computation with w, x interchanged yields 

HgX^{P),^{Q)) _ _ a^wl[is{r‘^) - s^a^wl - dsr^] _ 

2(1 + s) [?s('r^) “ S^a^wf\ [^s(t 2 ) — s‘^Q^w\ — 4sr2] — 4 s 2 Q! 27 - 2 y ;2 


(76) 


(77) 


We define 


H{s) 

H{s) 


HgXQ,P)+HgSP,Q) 

hiiyi 2 ( 1 + s) 


HgXHQ),4>{P))+HgmP)MQ)) 

KIPI 2 ( 1 + s) 


Since (<h) > H{s)/H{s) for every s G [0,1], we may compare H{s)/H{s) with ??k^'®™(4>) 

for s near 1. For this we observe by (|7H) . ([76]) and ([77|) that 77(1) = 2/3 and 


~ 2a^ {A — 20^ {A — — At'^) 

(4 — a2^^4 _ q ,2 _ 4 .^ 2 ^ _ 40 , 27-2 ^4 _ 0^)2 — 16t2 
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Now assume that 4 t^ > (1 — q;^)( 4 — a^) (in particular, > 0). Since (4 — ~ > 0 

(thanks to < 1 and < 1) and 

3(4 — — 4r^) — [(4 — — 16r^] = 4r^ — (1 — «^)(4 — a^) > 0, 

it follows that -ff(l)/-ff(l) > a^. From the continuity of the s-dependence of H{s)/H{s) and 
™ ()73p . we arrive at the first assertion stated in the theorem. 

For the second assertion, when = 1, a tedious computation gives 

H{s) 2 4” 2)(2s + 1) |^12s(s + 1)^ + (2s^ + + s + 2)Q;^j 

H{s) (§2 _)_ 4s _|_ 1 ) |^4s(s + 1) + s3Q,2j |^4s(s + 1) + a^] 

The limit of as = 1 — tends to 0 is 

3s(s + 2)(2s + 1) 

(s2 + 4s + l)(s + 1)2 

The numerator minus the denominator of the above ratio is — (s'^ — 5s^ + 1), which is positive 
when . This yields the second assertion of the theorem. QED 


B.4 Useful results 
B.4.1 Basic formulas 

We observe that any Hermitian matrix with Tr A = 0 can be written as 4. = y-a with y S R^, 
and that Tr {al + w-cj) = 2a. 

The following formulas will be useful: 


{al + 'w-a){bl + y-a) = {ab + w • y)I + (ay + bw + iw x y)-<T, 

1 


{al + w-cr) = 


(5/ +WfT)^/2 ^ 


a2 — |w|2 


(al — w-a), 


C(5,w) 


1 + 


W-(J 


C(6 ,w)J’ 


(78) 

(79) 

(80) 


where C{b, w) = 6 + -^f)2 — |w|2. 

It will be convenient to use the physicists bra and ket notation for vectors in R3 as well 
as in which |x)(x| denotes |xp times the projection onto x, more generally |w)(x| : y 1—)• 
(x • y)w. In that notation, if a 7^ 0, 6|wp then 


(a/ — 6|w)(w|) ^ = a 


-1 


1 + 


|w)(w| 


(81) 


a — 6|w|2 

The following lemmas are useful in Sections B.1-B.3 to prove the theorems of Section 6. 
Lemma B.l. Let A,B be positive linear operators on R^. Then 

^{Ty,A-^Ty) _ ^{T*y,BT*y) 
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Proof. Writing z = B one finds 


sup 


{Ty,A-^Ty) 

{y,B-^y) 


(B^/^z,T*A-^TB^/^z) 

sup- - -^- 

11^1/2^*^-! jn^l/2 II 

11^-1/22^ bT* A~^^^\\ 

iA-^l‘^z,TBT* A-^l‘^z) 
sup - - -r- 


where we have used the fact that ||T*T|| = ||TT*|| with T = ^ i/22"^i/2^ Then redefining 
y = A~^/‘^z gives the desired result. QED 


Lemma B.2. For any fixed /r > 0, n > 0 and w, the minimum of 


F{y2,y3) = Ai(l + 2/2 + 2/1) + + ^22/2 + wsyaf 


is 


minF(y2,2/3) 

y 2 ,y 2 


+ u |wp) 
fj, + i/(|wP — wf) 


Proof. The condition that VT = 0 yields two linear equations which can be written in the 
form 

fp. + vwl vw2W^\ fy2\ ^ /rc2\ 

\nw 2 W 3 n + nwy\y 3 j vW 

This has the solution 



— VWi //i + zyrCg —UW2W3\ fw2 
det \-vw 2 W 3 /r + z/rcly V^3 


—liu Wl 

det 



—u Wl 

^i + v{wl + wl) 



since det = + fan {wl + wl)- It is now easy to compute the value of F at this solution as 


+ n{wl + rc|)] ^ + ixn'^wl {wl + r(;|) + fi’^nwf 

[n + n{wl + wl)]‘^ 

_ ^i[fi + n{wl + wl)] + finwl _ /i(/z + |wp) 

IJ, + n{wl+wl) /r + i/(|wP — r(;2) 


QED 


Lemma B.3. Let P = I + w-a with |w| < 1. Then for every y £ R^, 


2 

Lp + Rp 


(yo-) = 


W • y 


1 — IwP 


1 + 


y + 


(w • y) 

1 — |w 


w 

w 


'(7 


(82) 


Proof. Write fil + z-a = 2{Lp + Rp) ^{y-a). Then (fTHI) and z x w = — w x z imply that 

y-cT = i(/+ ■w-(t)(/ 3/+ z-cr) + i(/3/+ z-cr)(/+ w-cj) 

= (/3 + z • w)/ + (z + /3 w)-(T. 
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Since I and the Pauli matrices form a basis for M 2 , this implies /3 = — z • w and y = z + /3w 
so that 

y = z — (z • w)w = (/ — |w)(w|)z. 


Then using (|81l) with a = 6 = 1 we find that 

|w)(w| 


z = 


/ + 


1 — w 


Inserting this into /3 = — z • w yields 


y = y + 


QED 


(w • y)w 

1 — IwP 


B.4.2 Lemmas for extreme points 

Lemma B.4. Let 0<s<l,P = I + w-it and Q = I + x-(T with |w|, |x| <1. Let u = w — sx 
and V = w + sx. Then for every y E R^, 


Tr (yo-) 


1 + s 


-(yo-) 


Lp + sRq 

, / Ir, ^9 , ,9', , ,, , , ,, , |u X v)(u X v| 1 \ 

= 2(1 + s) (y, ^{(1 + s) - |u| }/ + |u)(u| - |v)(v| - y^, 

where the operator inside [ ]~^ of (1831) is positive and invertible. 

Proof. As above, let jdl + z-a = (Lp + sRq) ^(y-cr) so that 
y-cj = {Lp + sRq) (/ 3 / + z-cj) 

= [(1 + s)/3 + z • (w + sx)]/ + [(1 + s)z + /3(w + sx) — iz x (w — sx)]-(T 
= [(1 + s)f3 + Z • v]/ + [(1 + s)z + /?V — ZZ X u]-(7. 

Since the coefficient of I on the right side of (l8^ must be 0, we find 

Z • V 


/3 = - 


1 + s 


Inserting this into 


and equating real and imaginary parts yield 
|v)(v| 


y = 
0 = 


(l + s)/- 
(1 + s)/- 


1 + s 

|v)(v| 

1 + s 


Zi + Z2 X U, 


Z2 — Zl X U, 


(83) 


(84) 


(85a) 

(85b) 


where we have written z = zi + iz 2 and (z • v)v = |v)(v|z. Solving (|85bp for Z 2 with use of 
(| 8 T]l yields 


Z2 


1 

1 + s 


i + _tAI_ 

(l + s)2-|v|2 


(zi X u). 


Inserting this into (j85a|) gives 


y = 


1 + s 


{(1 + S)2-|U|2}/+|U)(U|-|V)(V| 


|u X v)(u X v|l 

(1 + S)2 - jvpj 


( 86 ) 
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where we have used 


(zi X u) X U = —(u • u)zi + (u • Zi)u, 

[|v)(v|(zi X u)] X U = [v • (zi X u)] V X U = — |v X u)(v X u| Zi 

in the first and the second terms from Z 2 , respectively. Thus we have proved that for every 
y G there exists a zi G satisfying (| 86 p . This implies that the operator inside [ ] of 

(j 86 p is surjective and hence invertible. Since 

Tr (y-cr) ^ ^ „ (y-o-) = Tr iy-a){/31 + z-a) = 2y ■ zi, 

Lp + sRq 

we obtain ([551) by solving for zi in (I55p . Moreover, since the LHS of (1551) > 0, the operator 
inside [ is indeed positive. QED 

In the case where y = w — x G spanju, v} and so y is orthogonal to u x v, we can simplify 
the expression above. 

Lemma B.5. In the notation of Lemma 
becomes 

2(1 + s)2 ^y, [{(1 + s)2 _ |u|2}/ + |u)(u| - |v)(v|] "V). (87) 


B.^, when y is orthogonal to u x v equation 


Proof. First note that when X > 0 and I — W — X >0is invertible, then / — bF > 0 is also 
invertible. Then it suffices to observe that when WX = 0 and Xy = 0, 


{I-W-Xr^y 


OO 


OO 


J](iy + X) V = E 

k=0 k=0 


and apply this with 


W = 


|v)(v| - |u)(u| 
(1 + s)2 — |u|2 ’ 


X = 


|u X v)(u X v| 
[(1 + s)2 — |u|2] [(1 + s)2 



QED 


Note that when P = Q so that u = (1 — s)w and v = (1 + s)w, the s-dependence of all 
terms in ([87P has the form (1 ± s)^/(l + s)^ = (1 ± s“^)^/(l + which implies (l 68 P for 

A = y-a as 

1 + s 1 + 1 + s 

Lp + sRp Lp + s~^Rp Rp + sLp 
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